Modern Homology Search
نویسندگان
چکیده
Dynamic programming [1] has full sensitivity, but too slow for large scale homology search. FASTA / BLAST type of heuristics [2] trade sensitivity for speed. Can we have both sensitivity and speed? We present the mathematical theory of optimized spaced seeds which allows modern homology search to achieve high sensitivity and high speed simultaneously. The spaced seed methodology is implemented in our PatternHunter software [3, 4], as well as many other modern homology search software, serving thousands of queries daily. The theory is then extended and implemented in ZOOM [5] to do fast genome scale reads mapping for the second generation sequencers. Joint work with Bin Ma, John Tromp, D. Kisman, Hao Lin, and Zefeng Zhang. References[1] S.F. Altschul, W. Gish, W. Miller W, E.W. Myers, D.J. Lipman. Basic local alignmentsearch tool. J Mol Biol 215:3(1990), 403–410.[2] T.F. Smith, M.S. Waterman, Identification of Common Molecular Subsequences.Journal of Molecular Biology , 147(1981), 195–197.[3] B. Ma, J. Tromp, M. Li, PatternHunter: Faster and more sensitive homology search.Bioinformatics, 18:3(2002), 440–445.[4] M. Li, B. Ma, D. Kisman and J. Tromp. PatternHunter II: highly sensitive and fasthomology search. J. Bioinformatics and Computational Biology , 2:3(2004), 417–440.[5] H. Lin, Z. Zhang, M.Q. Zhang, B. Ma, M. Li. ZOOM! Zillions of oligos mapped.Bioinformatics. In press. 2008.
منابع مشابه
Modern BLAST Programs
The Basic Local Alignment Search Tool (BLAST) is arguably the most widely used program in bioinformatics. By sacrificing sensitivity for speed, it makes sequence comparison practical on huge sequence databases currently available. The original version of BLAST was developed in 1990. Since then it has spawned a variant of specialized programs. This chapter surveys the development of BLAST and BL...
متن کاملFast Homology Search using Categorisation Profiles
Homology search is an important step in discovering evolutionary relationships in modern molecular biology. In particular, it is key to analysing data from large-scale sequencing initiatives: by establishing homology between a newly-discovered sequence and well-understood sequences in curated repositories, it is possible to infer structure and function without the need for costly, time-consumin...
متن کاملNew Algorithm Transgenic with Homology to solve the problem of OneMax
Already proposed a new vision, a heuristic based on a modern branch of biology called Biotechnology. "This is any technological application that uses biological systems, living organisms, or derivatives thereof, to make or modify products or processes for specific use" [SCDB, 1992]. For individuals who have undergone any change in their genes through biotechnology techniques are known as Transg...
متن کاملExperimental search for minimal organisms and the last universal common ancestor: Reconstructing the Ur-Organism
T wo questions that should be closely related have historically been studied with very different approaches. One is what constitutes a minimal living system, whether minimal cell or minimal self-contained ecosystem. The other is what actual system was the last universal common ancestor (LUCA) of all modern cells. As the LUCA is supposed to have been a bottleneck through which all life passed be...
متن کاملComputational Identification of Micro RNAs and Their Transcript Target(s) in Field Mustard (Brassica rapa L.)
Background: Micro RNAs (miRNAs) are a pivotal part of non-protein-coding endogenous small RNA molecules that regulate the genes involved in plant growth and development, and respond to biotic and abiotic environmental stresses posttranscriptionally.Objective: In the present study, we report the results of a systemic search for identifi cation of new miRNAs in B. rapa using homology-based ...
متن کامل